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Amendments to the Claims 



1. (currently amended)A computerized system for augmenting data from a 
source database with data from a reference database, to generate an 
5 augmented database that can be used for predictive modeling, 

comprising: 

a source database comprising structured data; 
a reference database having reference data; 

a locator component configured to use the structured data to locate 
10 reference data in the reference database suitable for association 

with the source database; 
an analyzer component configured to process the reference data into a set 

of descriptors and associating the descriptors to the source data to 

form an augmented database; 
15 a predictive modeling component configured to classify behavior with the 

augmented database; an4 
a data mining component configured to conduct searches of data in the 

augmented databas e; and 
a display for displaying the search results . 

20 

2. (original) The computerized system of Claim 1, wherein the source 
database contains financial transaction data. 

3. (original) The computerized system of Claim 1, wherein the source 
25 database contains telephone call detail records, and wherein the 

reference database contains business indices and telephone directories 
augmented by public information on merchants and service providers. 

4. (original) The computerized system of Claim 1, wherein the source 
30 database contains investment transactions and the reference database 
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contains public information regarding companies, mutual funds and/or 
other investment interests. 

5. (original) The computerized system of Claim 1, wherein the source 
5 database contains insurance transactions, and wherein the reference 

database contains information regarding insurance products, claims 
and/or insurance evaluations. 

6. (original) The computerized system of Claim 1, wherein the source 
10 database contains product inventories, and wherein the reference 

database contains information describing products. 

7. (original) The computerized system of Claim 1, wherein the source 
database contains Internet browser view transactions, and wherein the 

15 reference database contains the Internet pages of the browser view 

transactions. 

8. (original) The computerized system of Claim 1, wherein the source 
database contains retail transactions at an individual product level, and 

20 wherein the reference database contains product information from 

catalogs. 

9. (original) The computerized system of Claim 2, wherein the structured 
data comprises at least a name or identifier corresponding to a merchant, 

25 product and/or service. 

"10. (original) The computerized system of Claim 1, wherein the reference 
database contains data in an unstructured format. 

30 11. (original) The computerized system of Claim 10, wherein the 
reference database comprises a public database such as the Internet. 
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12. (original) The computerized system of Claim 11, wherein the locator 
component locates electronic pages on the Internet related to merchant, 
product and/or service identified of the structured data in the source 

5 database. 

13. (original) The computerized system of Claim 12, wherein the locator 
component includes a spider module that searches for embedded links, 
keywords and/or references in the text found at the located electronic 

10 pages. 

14. (original) The computerized system of Claim 12, wherein the locator 
component retrieves the natural language text from the located electronic 
pages. 

15 15. (original) The computerized system of Claim 14, wherein the 
processing of reference data in the reference database is accomplished 
by reducing the natural language text to a set of weighted keywords. 

16. (original) The computerized system of Claim 12, wherein the locator 
20 component validates the located electronic pages using zip code and/or 

Standard Industry Code (SIC) information stored in the source database. 

17. (original) The computerized system of Claim 1, wherein the predictive 
modeling module uses one or more of the following methodologies: 

25 model-based regression, non-parametria regression (e.g., neural 

networks), Bayesian inference, hidden Markov models, fuzzy logic 
models, evolutionary models, or decision trees. 

18. (original) The computerized system of Claim 1, wherein the source 
30 database comprises account based transactional records and the analyzer 
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component aggregates the data from the source database and its 
associated reference data by reference to an account field. 



19. (original) The system as defined in Claim 18, wherein the association 
5 of unstructured data from the reference database is delivered through a 

predictivie statistical model built from known historic outcomes associated 
with records within the source database. 

20. (currently amended)A computerized system for augmenting data from a 
10 source database with data from a reference database to generate an 



augmented database that can be used for predictive modeling, 
comprising: 

a source database comprising a plurality of transaction data records with 

each transaction data record having at least one field identifying a 
15 merchant, product and/or service; 

a merchant identifier database comprising a plurality of reference 

addresses and value description identifiers for merchants, products 

and/or services; 
a reference database; 
20 an address locating module configured to search the reference database 

to locate references for merchants, products and/or services 

identified in the source database; 
an account description database; 

a transaction augmentation module, configured to attach the value 
25 description of a particular merchant, product and/or service to the 

transaction data records and store the resulting combined record in 
the account description database; and 
a merchant analysis builder module configured to condense the 
references provided by the address locating module into a value 
30 description and store the value description in the merchant identifier 

database. 

5 
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21. (original) The system of Claim 20, further comprising an account 
descriptor builder module configured to generate descriptive account 
records from the merchant identifier database and the source database. 

5 

22. (original) The system of Claim 20, further comprising a lexicographic 
database configured to index value description identifiers to keywords. 

23. (original) The system of Claim 20, wherein the reference database 
10 comprises the Internet. 

24. (original) The system of. Claim 20, further comprising a predictive 
modeling module configured to predict future behavior of accounts, 
merchants, or other entities, using data from the account description 

15 database. 

25. (original) The system of Claim 20, further comprising a data mining 
search engine configured to conduct keyword searches of the account 
description database to identify accounts, merchants, or products. 

20 26. (currently amended)A computerized method of augmenting data from a 
source database with data from a reference database, the method 
. comprising: 

retrieving at least one data record recording an event from the source 
database; 

25 identifying a field in the data record that specifies an entity; 

locating reference data from the reference database that describes the 

entity specified by the entity field; 
processing the reference data to form a set of keyword descriptors 

describing the entity; 

30 augmenting the data record with the keyword descriptors to generate an 

augmented data record describing the entity; 



6 
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building an account descriptor database that includes at least one data 
record that correlates the at least one event with the description of 
the entity from the augmented data record; an4 

searching the account descriptor database for selected data records that 
meet a desired criteria ; and 

storing said selected data records for further processing . 

27. (original) The method of Claim 26, wherein the locating reference data 
includes locating data in an unstructured database. 

28. (original) The method of Claim 26, wherein the reference database 
includes at least a portion of the Internet. 



10 



29. (original) The method of Claim 28, wherein the locating reference data 
15 includes locating electronic pages using the entity specified in the at least 

one data record. 

30. (original) The method of Claim 29, wherein locating reference data 
further spidering for additional electronic pages cited within the located 

20 electronic pages. 

31. (original) The method of Claim 26, wherein locating reference data 
includes reducing natural language text to keyword descriptors. 

32. (original) The method of Claim 26, further comprising validating the 
25 located reference data using data from the at least one data record. 

33. (original) The method of Claim 26, further comprising storing the 
augmented data record in a merchant database. 
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34. (currently amended)A method of augmenting structured data stored in a 
source database with unstructured data stored in a reference database, 
comprising: 

reading a data record from the source database; 
5 searching the reference database for information describing the data 

record; 

condensing the information, describing the data record into at least one 

keyword description; 
augmenting the data record with the keyword description ; and 
10 storing said augmented data record . 

35. (original) The method of Claim 34, wherein the reference database 
comprises the Internet. 

15 36. (original) The method of Claim 35, wherein the data record contains at 
least a merchant name or identifier. 

37. (original) The method of Claim 36, wherein searching the reference 
database further includes locating electronic pages related to the 

20 merchant identified in the data record. 

38. (original) The method of Claim 37, wherein searching the reference 
database further includes retrieving the natural language text from the 
located electronic pages. 

• 25 

39. (original) The method of Claim 38 t wherein condensing the 
information comprises reducing the natural language text to at least one 
weighted keyword. 
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40. (currently amended)A computerized system associating unstructured data 
in a reference database with structured data in a source database, 
comprising: 

means for reading a data record from the source database; 
5 means for searching the reference database for information describing the 

data record; 

means for condensing the information describing the data record into at 

least one keyword description; aed 
means for augmenting the data record with the keyword description ; and 
10 means for storing said augmented data record . 

41. (original) The computerized system of Claim 40, wherein the 
reference database comprises the Internet. 

15 42. (original) The computerized system of Claim 41, wherein the data 
record contains at least a merchant name or identifier. 

43. (original) The computerized system of Claim 42, wherein the means 
for searching the reference database includes means for locating 

20 electronic pages related to the merchant identified in the data record. 

44. (withdrawn) 

45. (withdrawn) 

25 

46. (previously presented) A computerized system for augmenting data 
from a source database with data from a reference database to generate 
an augmented database that can be used for predictive modeling, 
comprising:. 

30 a source database comprising structured data; 

a reference database having reference data; 
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a locator component configured to use the structured data to locate 

reference data in the reference database suitable for association 

with the source database; and 
an analyzer component configured to process the reference data into a set 
5 of descriptors , to store the set of descriptors, and associat i ng 

associate the descriptors to the source data to form an augmented 

database. 

47. (previously presented) The computerized system of Claim 46, further 
1 0 including a predictive modeling component configured to classify behavior 

with the augmented database. 

48. (original) The computerized system of Claim 46, further including a 
data mining component configured to conduct searches of data in the 

15 augmented database. 

49. (original) The computerized system of Claim 46, wherein the source 
database contains financial transaction data: 

20 50. (original) The computerized system of Claim 47, wherein the 
structured data comprises at least a name or identifier corresponding to a 
merchant, product and/or service. 

51. (original) The computerized system of Claim. 46, wherein the 
25 reference database contains data in an unstructured format. 

52. (original) The computerized system of Claim 51 , wherein the 
reference database comprises a public database such as the Internet. 

30 53. (original) The computerized system of Claim 52, wherein the locator 
component locates electronic pages on the Internet related to merchant, 

10. 
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product and/or service identified of the structured data in the source 
database. 

54. (original) The computerized system of Claim 53, wherein the locator 
5 component includes a spider module that searches for embedded links, 

keywords and/or references in the text found at the located electronic 
pages. 

55. (original) The computerized system of Claim 53, wherein the locator 
component retrieves the natural language text from the located electronic 

10 pages. 

56. (original) The computerized system of Claim 55, wherein the 
processing of reference data in the reference database is accomplished 
by reducing the natural language text to a set of weighted keywords. 

15 

57. (original) The computerized system of Claim 53, wherein the locator 
component validates the located electronic pages using zip code and/or 
Standard Industry Code (SIC) information stored in the source database. 

20 58. (original) The computerized system of Claim 46, wherein the 
predictive modeling module uses one or more of the following 
methodologies: model-based regression, non-parametric regression (e.g., 
neural networks), Bayesian inference, hidden Markov models, fuzzy logic 
models, evolutionary models, or decision trees. 



25 



59. (original) The computerized system of Claim 46, wherein the source 
database comprises account based transactional records and the analyzer 
component aggregates the data from the source database and its 
associated reference data by reference to an account field. ' 



30 
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60. (original) The system as defined in Claim 59, wherein the association 
... of unstructured data from the reference database is delivered through a 
predictive statistical model built from known historic outcomes associated 
with records within the source database. 
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